Picture for Kui Ren

Kui Ren

The State Key Laboratory of Blockchain and Data Security, Zhejiang University

ConsisGuard: Aligning Safety Deliberation with Policy Enforcement in LLM Guardrails

Add code
May 29, 2026
Viaarxiv icon

RouteScan: A Non-Intrusive Approach to Auditing MoE LLMs Safety via Expert Routing Telemetry

Add code
May 24, 2026
Viaarxiv icon

LoopTrap: Termination Poisoning Attacks on LLM Agents

Add code
May 07, 2026
Viaarxiv icon

When AI reviews science: Can we trust the referee?

Add code
Apr 26, 2026
Viaarxiv icon

JANUS: A Lightweight Framework for Jailbreaking Text-to-Image Models via Distribution Optimization

Add code
Mar 22, 2026
Viaarxiv icon

STEP: Detecting Audio Backdoor Attacks via Stability-based Trigger Exposure Profiling

Add code
Mar 18, 2026
Viaarxiv icon

When Detectors Forget Forensics: Blocking Semantic Shortcuts for Generalizable AI-Generated Image Detection

Add code
Mar 10, 2026
Viaarxiv icon

Towards Cross-lingual Values Assessment: A Consensus-Pluralism Perspective

Add code
Feb 19, 2026
Viaarxiv icon

Explainable Token-level Noise Filtering for LLM Fine-tuning Datasets

Add code
Feb 16, 2026
Viaarxiv icon

HyperPotter: Spell the Charm of High-Order Interactions in Audio Deepfake Detection

Add code
Feb 05, 2026
Viaarxiv icon